Picture for Qingyang Wu

Qingyang Wu

Search Your Block Floating Point Scales!

Add code
May 12, 2026
Viaarxiv icon

Introspective Diffusion Language Models

Add code
Apr 13, 2026
Viaarxiv icon

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Add code
Apr 09, 2026
Viaarxiv icon

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Add code
Mar 04, 2026
Viaarxiv icon

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Add code
Feb 06, 2026
Viaarxiv icon

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Add code
Dec 31, 2025
Viaarxiv icon

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

Add code
Nov 17, 2025
Viaarxiv icon

Data Diversification Methods In Alignment Enhance Math Performance In LLMs

Add code
Jul 02, 2025
Figure 1 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 2 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 3 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 4 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Viaarxiv icon

Disentangling Reasoning and Knowledge in Medical Large Language Models

Add code
May 16, 2025
Figure 1 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 2 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 3 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 4 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Viaarxiv icon

How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?

Add code
Apr 19, 2025
Viaarxiv icon